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Effects of Instructional Differences among Ability Groups 
on Student Achievement ia Middle-School Science and Mathematics 



ABSTRACT 

This paper analyzes the effects of ability grouping on middle school math and science achievement, 
attempting to account for these effects in terms of measurable classroom experiences of students. The 
authors hypotljesize that grouping effects operate through classroom instructional differences. Data 
collected from a national sample of public school students and teachers reveal large effects of group 
placement on 8th-to-9th grade achievement growth, and substantial track differences appear for 
several instructional variables. Adding the instructional variables to the achievement models shows 
that 75 percent of the high-ability group effect on science achievement, but only 17 to 33 percent of 
the mathematics track effects are accounted for by the instructional variables. The largest 
instructional effects on mathematics achievement are associated with emphases on problem solving 
and understanding principles (as opposed to computational mechanics), the pacing of the course, and 
the use of lectures to present the material. The largest effects on science achievement are found for 
measures of "inquiry-based" instruction, the use of lectures and small-group projects, and student 
assessments of the accessibility of the subject matter. 
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Effects of Instructional Differences among Ability Groups 
on Student Achievement in Middle-School Science and Mathematics 



What distinguishes "low groups" from "high groups" in ability-grouped schools? Most 
research on grouping has focused on differences in learning outcomes, and usually finds that high- 
group students learn much more than low-group students. A large part of the difference is due to the 
more advantaged backgrounds of higher group students, but several studies have concluded that some 
of the difference is due to group membership itself. As is true of many other types of group 
differences, survey researchers have not had much success in accounting for the effects of ability 
grouping. Presumably, the level of a group affects classroom processes, and these classroom 
differences contribute to inequality in achievement. If the relevant classroom processes cannot be 
identified, though, then one might well be sceptical of the original claim for grouping effects on 
learning, since the unexplained effects could amount to nothing more than uimieasured selection 
factors. 

Current policy interest in the consequences of ability grouping give a clear practical 
importance to these issues. Ability grouping in the middle and upper secondary grades is increasingly 
viewed as a structure producing inequalities in student educational outcomes. Some writers emphasize 
the need to find alternatives (Oafces and Lipton, 1992), while others call for improving the way ability 
grouping is implemented (Gamoran, 1991), Given the current widespread use of ability grouping, it 
is essential to document hov/ its impact occurs, to help guide future reforms. 

This study addresses the issue of the instructional consequences of ability grouping with 
recently collected data on mathematics and science education in American middle schools, Tlie 
analysis makes three contributions to current research on the effects of ability grouping. First, studies 
of the use and consequences of grouping at the middle schooE level are rare, and attention to that level 
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should contribute to a greater understanding of secondary school tracking, a subject ^vhich has 
received much more attention, A second contribution is that we analyze grouping within specific 
subjects and across classrooms, a distinctive and arguably more appropriate focus than the somewhat 
amorphous across-subjects-and-classrooms design of most work on secondary school tracking. Third, 
this study makes use of information about the conduct of classes collected from students and their 
teachers to assess the effects of factors hypothesized - but rarely tested - to account for the effects of 
group placement. 

RESEARCH ON THE EFFECTS OF TRACKING ON ACHIEVEMENT 
The conceptual model that guides our research is schematically represented in Figure 1. The 
diagram shows individual student achievement growth (a one year time period is analyzed here) to be 
most directly affected by learning opportunities provided in courses. We distinguish two general 
types of opportunity variables. The first, shown on die right-hand side of the diagram, consists of 
instructional variables: curricular objectives or emphases, time use ("quantity"*), and presentational 
methods ("quality"). The second type consists of what are more properly considered "resource" 
variables, or what cla:;ses are given by the schools which support them. These resources include the 
background and expectations of the teacher assigned to teach the student's class, as well as factors 
such as class enrollment size, access to laboratory facilities, and the amount of time scheduled for 
classes to meet. We hypothesize that both types of opportunities are in turn affected by the ability 
level Or track of the class. Finally, Figure I shows that student background variables affect ability 
group placement. For present purposes, we consider background to include variables determined 
prior to the start of the achievement growth period (fall semester of the 8th grade in our analysis) 
which affect growth. Background thus encompasses social -demographic variables such as gender, 
race^ethnicity, and socioeconomic status; family socialization variables such as parental expectations 
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for school success and fiirther study; and student academic baclcground variables such as initial levels 
of academic interest, achic^vement, and grades. 

Our review of prior research suggests that the hypothetical model shown in Figure 1 is 
problematic in three respects. First, there is some question over whether ability group placement 
really has any effects on student achievement. Second, student experiences and learning opportunities 
have been shown to vary with ability group rank, but it is not clear how much these dififerences 
reflect real effects cf group rank as opposed to pre-existing differences among students. Third, the 
links between the kinds of classroom-based learning opportunities that observers have focused on and 
student achievement are largely unexamined by research using sample surveys and correlational 
methods. 

Track Effects on Learning 

For present purposes, we consider effects of ability grouping or tracking on achievement to 
consist of outcome differentials between groups that cannot be accounted for by student background 
differences. While our analytic focus is on ability groups within panicular subject domains, 
specifically eighth-grade mathematics and science, tlie genera! model shown in Figure 1 is also 
applicable to high school tracks that cut across subjects and extend over the entire high school grade 
span. Most studies of curriculum tracking in secondary schools have found that inequality of 
achievement across tracks increases over time among students with similar academic and social 
backgrounds (see Gamoran and Berends, 1987, for a review). These studies mainly relied on global 
indicators of track positions, such as academic, general, and vocational programs, usually drawn from 
student reports, A limitation of this type of measure ii; that in many schools, students are grouped by 
performance into different classes within such tracks, and furthermore many schools do not have such 
broad programs at all, though they still sort students by ability for specific subjects (Oakes, 1985). 



studies of ability grouping within subjects have yielded mixed results, with some showing 
increasing inequality, but others not. In a review, Slavin (1990) concluded that the average effects of 
ability grouping versus heterogeneous assignments on ^achievement in secondary schools is zero, and 
the so-calltxl "di^erential effects" that appear in some studies - whereby high-ability students appear 
to benefit while low-ability students are harmed - reflect selection bias, that is, uncontrolled 
differences among students assigned to different groups. Because students are assigned to ability 
groups on the basis of past and expected future performance in school, what appears to be increasing 
inequality may simply reflect U)e impact of pre-existing differences. 

Without minimizing the importance of addressing selection bias, another interpretation of the 
mixed results which Slavin (1990) underscored is also possible. Most of the studies that Slavin 
reviewed were small-scale and of limited duration. It may be that grouping has different effects under 
different circumstances; for example, in one school, students at all levels may receive the same 
instruction, while in another, high-group students may have opportunities not granted to other 
students. Differing results may thus reflect varied implementation, rather than measurement error. 
Hoffer (1992a) found that the effects of middle-school ability grouping do in fact vary from school io 
school, but stopped short of giving an explanation of that variability. Testing an elaborated model of 
between-school variation in the effects of high school uracking, Gamoran (1992) found that the effects 
of academic track placement on math and verbal achievement varied between high schools, being 
larger in schools characterized by lower levels of track mobility and smaller relative academic urack 
enrollments. 

The inconsistencies in the numerous small-scale studit« Slavin (1990) reviewed thus may 
reflect variability in the ways grouping is implemented, but it would still be useful to have a summary 
picture of grouping effects based on a national sample. Unfortunately, national survey data on 
subject-specific ability grouping have not been available for the United Spates until quite recently. 



Drawing on a panel study of national sample of students which began in 1987, Hoffer (1992b) found 
consistent evidence for differential effects in middle-school science and mathematics using several 
different methods of controlling for pre-existing differences. These results compared quite closely 
with KerckhofFs (1986) fmdings from an analysis of a large national sample of British students, 
Kerckhoff (1986) reported that between ages 11 and 16, students assigned to high-ability classes 
gained, while students assigned to low-ability and remedial classes fell furth^ and further behind. 
These findings held despite a variety of controls for pte-existing differences among students assigned 
to different types of classes, 

li y ck Effect Mech.inisms 

^liile evidence of track effects on achievement is found in several :tudies» the mechanisms 
generating the eiTects ai e less well documented (Oakes, Gajnoran, & Page, 1992, p, 599). Die most 
widely-cited study of tracking in .©cent years (Oakes, 1985) presented a v;ealth of useful descriptive 
inf^rmatior along with pnvocative i^tarpretatioris of how tracking is implemented, experienced, and 
understood by itudents and teachers, but did not pursue quantitative analyses of tracking effects on 
student oivtcomes and their n-^chanisms. On the basis ct Oakes' (1935) and other descriptive accounts 
of U3ck differ<;^nces in classroom activities, Gamoran (1989) argued that variation across tracks in the 
quality and quantity of instrucdon was likely the most powerful mechanism through which tracking 
producBai unequal achieveniiat. In the present study, we examine variation across tracks in three 
aspects of students' opportunities for learning: instructional objectives, quantity of instruction, and 
quaiity of instniccion* 

Instu ictional objectives refer to the emphases on specific types of cognitive and affective 
outcomes reflected in curriculum and instruction, Oakes (1985, ch, 4) showed that lower-track 
classes emphasized rote learning of formulaic knowledge ("basic skills"), while conceptualization, 

5 ' 

8 



ERIC 



BEST COPY AVAIUBLE 



problem solving, and (in English classes) creative thiiiking were relatively more important in higher- 
track classes. Teachers' socialization goals also differed: in low-track classes teachers stressed 
obedience (" learning to follow rules") whereas they focused more on nmturing self-direction in 
higher-track classes (Oakes, 1985, pp, 79-90). 

Our fu:st hypothesis, then, is that track differences in achievement result in part from 
variation in teachers* educational goals. Of course, these goals must be somehow conununicated to 
students or translated into action if diey are to affect student outcomes. For example, an emphasis on 
problem-solving implies providing opportunities for students to respond to complex questions instead 
of simple drills. Because we are frequently unable to ol:";ep/e the activities undertaken to accomplish 
instructional goals, we specify direct effects from objectives to outcomRS in Figure 1, allowing the 
objeaives to serve ^s proxies for their behavioral manifestations in the analyses. 

A second factor tliat could account for learning differences is the quantity of instruction , 
which includes "time on task" as well as ^'conf'vnt coverage" as discussed in previous studies (e.g., 
Denham and Lieberman, 1980; Barr and Dreeben, 1983; Kilgore, 1993). In some cases quantity is 
measured as the amount of time allocated to instruction (e.g., the number to hours a math class meets 
over the course of a semester or year). Such conditions usually do not vary across classes within 
schools, so they are unlikely to account for ability-group differences in achievement. More subtle 
indicators of instructional quantity are applied time variables (e.g., time spent covering new material 
as opposed to administrative or evaluation tasks, the amount of homework assigned, etc.). These 
conditions may well vary among different types of classes within a school. Cakes (1985, pp. 98-102) 
found that high-track teachers report spending a somewhat larger proportion of classroom time on 
instruction as opposed to discipline, routines, and socializing. High-track teachers also expect their 
students to spend more time on homework (Cakes, 1985, pp. 102-103). Other writers have noted that 



high*track classes proceed more rapidly through the curriculum, covering more academic material in 
the process (see Gamoran & Berends> 1987, for a review). 

Again, we do not find in the research literature direct tests of the explanatory power of these 
fectors with regard to leafniag differences among high- and low-ability group filass^. Extending the 
"quantity of instruction" hypothesis to an analysis of high school track differences, Gamoran (1987) 
found that academic track students complete more coursework in mathematics and that this could 
account for about half of the estimated track effect on math achievement. Differential coursetaking 
did not account for track effects in language arts> however, leading Gamoran to speculate that 
differences in the quality of instruction are more influential for reading comprehension and writing 
skills. 

A third way in which classes and track? may differ is the Quality of instruction . Quality of 
instruction refers to the manner m which material is presented. This incorporates what Nystrand and 
Gamoran (1991) refer to as "instructional discourse," and includes variables such as tlie coherence of 
lessons, tlie teacher's degree of follow up on student questions and answers, and the extent of 
interactive engagement of ideas (discussion) as opposed to one-way transmission of knowledge and 
passive reception* Several recent writers have argued that students learn more when they are 
intellectually challenged and when their ideas are treated as serious and valuable in their own rights 
(e,g„ Nystrand and Gamoran, 1991; Newmann, 1992), Kilgore (1993) conceptualizes instructional 
quality in similar terms, but includes an additional dimension of "accassibiSity," or the degree to 
which the difficulty of the materials is appropriately matched to the students' initial level of mastery. 

Oakes (1985, pp. 105-112) also argues that the quality of instruction differs among ability 
grouped classes. The dimensions of instructional quality she examined included teacher verbal 
clarity, goal-directedness, and punitiveness; student reports were used to measure the constructs. 
While the sizes of the differences were not reported, Oakes (1985, pp. \07-l09) noted that high-track 



students gave their teachers significantly higher ratings on these dimensions than their low-group 
couaterparts. Gamoran and Mystrand (1992) reported that high-ability English classes were 
characterized by more emphasis on discussion and more opesi-ended questions about literature than 
average or low-ability classes. 

Why might one expect ability groups to differ along the lines of instructional objectives, ' 
quantity, and quality? Our principal hypothesis is that these kinds of learning opportunities are likely 
to be affected by the student's ability group placement, but it is useful to consider the larger issue of 
why opportunity' differences may arise. One explanation of these differences in instruction is that 
teachers' and students' ex pectations of what is appropriate differs across tracks. TTie basic idea here 
is that placement in a given track carries with it a certain status and set of peer influences which 
affect students' attitudes and efforts. Meyer (1980, p. 30) suggests, in &ct, that the institution of 
tracking is legitimized to the extent that students in different tracta come to believe that they are 
essentially "different" and act accordingly. While none of the analyses of national survey data have 
beeii able to address this factor directly (see however Ganioran, 1986 for an indirect approach), 
several of the ethnographic studies have emphasized the social-psychological eftiects of track 
placement. Oakes (1985) noted that lower-track students tend to see themselves - and are seen by 
higher-track students - as less intelligent and able to learn. Other observers have argued that lower 
tracks are also characterized by anti-school norms which pressure students toward low levels of 
engagement and even disruptive behavior. This pattern of witlidrawal and rebellion has generally 
been interpreted as a defsnse mechanism against the stigma of being labeled as less intelligent. 

Teachers may adjust their teaching style in response to these differences in student 
orientations and behaviors. Some studies suggest that teachers of lower-track classes place fewer or 
lower demands on their students, not because they do riot want to challenge tlie students, but because 
the students simply refuse to do challenging assignments (Powell, Farrar, and Cohen, 1985). 
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Teachers* low expectations for student achievement in that case are grounded in practical realities. 
Other studies have argued that teachers could make greater demands but do not^ because they believe 
the students either are incapable of higher levels of performance, or do not need higher levels of skills 
and knowledge for their putative destinations (Oakes, 1985). 

A second explanation is essentially a "scarce resource" argument. The main variant of this is 
that the quality of instruction is tied to tlie teachers themselves. This conception of teachers and 
teaching is a static one, wherein teachers have certain talents and shortcomings, and their teaching 
behavior flows directiy from their characteristics. This view contrasts with the more dynamic view, 
just discussed, of teachers responding to their classes. Whatever the merits of the hypothesis that 
teacher background has a significant impact may be, some studies have found these background 
characteristics vary between tracks. Analyzing a uational sample of high school teachers, Talbert 
(1990) found that about one-third of all teachers teach mostly classes at a single ability level. Another 
recent national study of secondary-level math and science teachers showed that teachers of high-track 
classes are more experienced and better qualified iu their subjects than low-track teachers in the same 
schools (Oakes, 1990, pp. 62-^7). Rosenbaum's (1976) case snidy showed that teachers were 
allocated primarily to one track, and that teach, ^s reputed to be the best were assigned to the higher 
track classes. Rosenbaum's (1976) and Finley*S (1984) case studies noted that the higher track 
teachers spent more time and effort prep<u:ing for classes, and worked harder in class to challenge the 
students. 

While we will not pursue these hyt^otheses in an exhaustive manner here, we will examine 
some indicators of teacher expectations and professional backgrounds in a preliminary effort to shed 
some light on the main findings of our analyses. 
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DATA BASE AND MEASURES 
The data we analyze were collected by the Longitudinal Study of American Youth (LS AY) 
from fell 1987 to fall 1989. The LSAY is a six-year panel study of middle and high school science 
and math education v/hich began in fell 1987. The base year samples consisted of 3,1 16 seventh 
graders and 2,829 10th graders drawn from 51 pairs of middle or junior high schools and senior high 
schools. The schools were systematically selected with probability proportional to enrollment size 
within twelve sampling strata defined by region of the country (four categories) and type of 
community (niral> suburban, urban). Target samples of sixty students at both grade levels were then 
randomly selected within the sampled schools. In the fall of each year, students are administered 
achievement tests in math and science consisting of items drawn from the National Assessment of 
Educational Progress (NAEP) and a questionnaire asking about background and attitudes. In the 
spring of each year, the science and math teachers of each student in the sample are mailed 
questionnaires asking about various aspects of the specific classes in which the sampled students were 
enrolled. In spring 1988, all of the math and science teachers in the participating schools were also 
mailed a questionnaire asking about their professional backgrounds and characteristics of the their 
schools. The LSAY also conducts a telephone interview with an alternating parent of each student in 
the study each spring, collecting additional information on students' family backgrounds.^ 

Analysis Samples 

The data analyzed here are drawn from the middle school student instruments and the spring 
1989 teacher questionnaires. The models we estimate for mathematics and science achievement differ 
in some details, and the analysis sample compositions vary as a result. The math achievement 

^ The LSAY data and codebooka are available for public use at a nominal fee. 
For niore information/ contact the project at the following address: LShY , Social 
Science Research Instiitiute, Northern Illinois Universit^y, DeKalb, IL 60115. 
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subsample consists of 1,346 younger cohort (originally seventh grade) students who met the following 
criteria: (1) During tlie 1988-1989 school year, the student was still enrolled in one of the 46 original 
LSAY schools (out of the 51 total original schools) using ability grouping for eighth grade 
mathematics instruction; (2) the student completed the fall 1987 and spring 1989 questloimaires, and 
the fall 1989 science and mathematics achievement test; (3) the student's eighth grade math teacher 
completed the spring 19S9 class questionnaire; and (4) the eighth grade math teacher completed the 
spring 1988 teacher background and attitude questionnaire. Students who trar^ferred out of their 
original school were excluded because the LSAY did not collect ability group or teacher-class data tor 
those respondents, even though they continued to complete tests and questiomiaires. Students enrolled 
in the five original schools which did not use ability grouping were excluded because the the small 
number of cases precluded any strong conclusions. 

The science achievement subsample consists of 1,235 students meeting the following criteria: 
(1) the student was still enrolled in one of die original LSAY schools during the 1988-1989 school 
year, regardless of whether the school used ability grouping in eighth grade science; (2) the student 
completed the fall 1987 and spring 1989 questionnaires, and the fall 1989 science and mathematics 
achievement test; (3) the student's eighth grade science teacher completed the spring 1989 class 
questionnaire; the eighth grade math teacher completed the spring 1988 teacher background and 
attitude questionnaire; and (5) the student was not in a low- or remedial-level science class in eighth 
grade. We added the last restriction because after the other criteria were met, the sample included 
only 35 low-group students. Since this was too small a number on which to base any inferences, and 
since they are much different from average-group students on many measures, we decided to exclude 
them rather than collapse thein into the average group 

^ Hoffer'B (1992b) analysis of ability grouping effects ueing ttie LSAY data 
included the nongrouped students in mathematics and the low-group students in 
science. His analysis samples were larger than oursr because he did not use the 
teac tier -class or teacher-background data used here. 
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While the numbers in the matfa and science analysis samples are close, only about 900 
students are included in both subsamples. The reason is that a relatively higher proportion of students 
were excluded from the science subsample because of science teacher nonresponse to the class 
questionnaire. Despite following identical administration procedures, the LSAY realized response 
rates of .71 for the science and .76 for the math teachac-class questionnaires in spring 1989, Thus, 
while some students were excluded from the math analysis because they were in the small number of 
schools that did not use tracking in dghth grade math, this number was roughly matched by the loss 
of cases from tlie science analysis due to greater teacher nonresponse. 

The selection criteria taken together resulted in a sizable loss of students and a potential 
sample selection bias in the estimation of effects. We address this problem by including an 
adjustment term in our achievement equations, which in effect controls for uimieasured factors 
associated with the probability of exclusion from the sample and with achievement (BerJc, 1983).^ 

Measures 

Acadmig achievement 

Appendix A contains descriptions of the measures used in the analysis, including names of the 
variables as they appear in the LSAY public-use files. Cognitive achievement in science and 
mathematics are measured here as composite scores summarizing student performance in the different 
domains tapped by the NAEP-derived LSAY tests. The scores were estimated by Item Response 

' In addition to the analyses presented in Tables 3 and 4, we estimated a 
number of other regression equations to test the robustness of the results. By 
selecting the analysis subsamples in terms of the presence of different 
combinations of available data, the approximate case bases can range as high as 
H»2,200 for some the estimates of ability group effects on achievement (** Model 
1** in Tables 3 and 4), N-l,800 for estimates of resource effects ("Model 2"), and 
N=sl,700 for estimates of instruction effects ("Model 3"). The results and 
conclusions presented in the present paper were not appreciably altered by the 
alternative specifications. The main difference was that the lower case base 
usad in Tables 3 and 4 results in lower levels of statistical significance and 
thus more conservative estimates. 
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Theory (IRT) methods, whereby individual test items are first calibrated and Uien Students are scored 
according to their response patterns. The scores were scaled so that the average score of the seventh 
graders was equal to SO with a standard deviation of 10 {see Miller, et ai, 1992 and Suchner, 1988 
and 1989 for detailed discussion of the LSAY test construction procedures). 

The LSAY public use fdes include achievement scores with imputed values for cases with 
missing data. We used these imputed values fbr cases with missing data on the controls for prior 
achievement (the 1987 and 1988 science and math tests), but not for the ultimate dependent variables 
(the 1989 tests). The imputations were made with Uie "nearest neighbor hot deck" method described 
by Little and Rubin (1986). Cases with missing data ("recipients**) were matched with complete cases 
("donors") according to the students' patterns of course work in science and math, and their 
completed achievement tests. Imputed values for the recipients were then obtained by randomly 
drawing a single donor from the pool of matched donors. Approximately 4 percent of the cases in our 
analysis subsamples had one or more imputed scores on the measures of prior achievement. 

Abitiry group placement 

The ability levels of the students* mathematics and science classes are measured primarily 
from master schedules collected each year from the sampled schools. Ttie students provide 
information on their courses and teachers each semester, and diis information is matched with the 
school schedules to code the courses and teachers. In most cases it was clear from the schedule 
whether the school used ability grouping and, if so, the levels of each class that semester. If the 
schedule was not clear, or if teachers provided contradiaory Infomation about the level of the class 
in their questionnaire responses, phone calls to the school were made to clarify the situation. 
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Student academic anu sn ciai backgroun d 

Student background is represented by five variables: prior academic performance, SES, 
student gender, race-ethnicity, and parental expectations for success in mathematics and science. In 
addition to the achievement measures discussed above, we used students* self-reported grades as 
additional controls for prior academic performance. As with the measures of prior achievement, 
some imputed values weie calculated for cases with missing grade data. Here we used a simple OLS 
regression method, whereby grades were regressed on prior achievement and social background 
variables, and the regression coefficients then used to calculate predicted grade values for cases with 
missing information. About 10 percent of the cases have imputed data from this method. 

We constructed a composite SES measure by combining information from indices of parental 
education, occupation, and household possessions. The measures of parental expectations for math 
and science education were constructed by summing students* baseyear (fall 1987) responses to two 
sets ^one for math and another for science) of three checklist items asking about whether their parents 
(1) "... have always encouraged me to work hard on math [science]," (2) "... expect me to do well in 
math (science]," and (3) think math [science] is important/ Student gender and race-ethnicity 
were identified from student questionnaire items administered in the base year. Missing data on 
gender were coded by a check on student names; calls to the schools were made for cases with 
ambiguous names. Missing data on race-ethnicity were filled in by using school records if available. 

Class resource variables 

Class resources refer to variables that are allocated to classes either by direct administrative 
decisions or by more diffuse cultural mechanisms. They do not lefer directly to what transpires in 
classes, but they are likely to affect classroom activities and outcome. For present purposes, we 
include two types of variables under this general heading: teacher and class characteristics. Our 
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measures of teacher characteristics are limited, but are consistent with prior research and current 
policy discussions. One set of measures includes the teacher's professional background, indicated by 
edugational background and years of teachtng^xperience . These variables are measured by teacher 
self-reports collected in the LSAY 1988 teacher background survey and subsequently merged to the 
student records on a clas5*by*class basis. Background indicators include whether the teacher majored 
in tlie Subject (math or science), the number of college semester-equivalent courses the teachers 
completed in their subject and in science or mathematics education courses, and the nur.iber of years 
the teachiTS have taught their subject (science or mathematics). 

A second set of teacher characteristics includes indices of teacher expectatiotus. We rely on 
student reports for these measures. The student questionnaires administered in the spring semester of 
the eighth grade asked the students to indicate agreement or disagreement with several propositions 
about their math and science teachers. These questions focused on the teachers' encouragement and 
expectations. We conducted exploratory factor analyses of the student responses to class-related 
questions to determine if a smaller set of underlying factors could summarize the larger sets of related 
indicators. Principal components analyses found that the students* responses to the questions about 
their mathematics and science classes were patterned very comparably in both subjects. Three factors 
emerged from the battery: teacher academic_push , teacher effectiveness , and teacher*s career push 
(see Appendix A for a list of the specific items used in each scale). Although we used factor analysis 
10 help detect coherent instructional indicators^ we constructed the scales as unweighted additive 
composites. 

Class chaiacteristics are measured by three indicators from the spring 1989 science and math 
teacher class questionnaires. For both subjects, we use the teacher's report of the number of students 
enrolled the class, and the number of minutes per week the class met. For science, we also use an 
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indicator of whether the course had an additional laboratory class, and if so» how many minutes of 
laboratory the class had each week. 

Instructional variables 

We drew our measures of classroom instniction from the spring 1989 surveys of the students' 
math and science teachers, and from the fall 198S and spring 19S9 student luestionnaires. The math 
and science teacher surveys collected data on the mathematics and science classes of sibout 70 percent 
of the sampled students. The questions focused on class objectives, teaching methods, and classroom 
time use. 

Instructional objectives . Our measures of instructional objectives are taken from the teacher 
questionnaires. The teachers were asked about the relative emphasis they accorded each of several 
general objectives, rating each from 1 =no emphasis to 4=heavy emphasis. Factor analyses of these 
batteries found that only two raultiple-item scale emerged from the teacher reports on instructional 
goals. From the science teachers' questionnaire, we identified an ^^phasis on inquiry learning factor 
measured by five items, and an emphasis_on science and society measured by two other questions 
from the same battery. Both scales were constructed as unweighted averages of the component items. 
We also used one single-item indicator of objectives from this battery, a question asking how much 
emphasis "^Teaching science facts and principles" received. 

Our measures of instructional objectives in mathematics classes consist of single-item 
indicators, since no interpretable scales emerged from preliminary factor analyses. These items (see 
the Appendix for specific wordings) generally follow the same lines as the science class inquiry scale, 
measuring the instructional emphasis on "basic," in this case, computational, skills and "higher 
order, ** problem-solving skills. 
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Instructional quantity. The measures of instructional quantity are also drawn mainty from the 
teacher questionnaires, but are supplemented with some student reports. Teacher reports of textbook 
coverage, the percentage of class time devoted to covering new material, and the average amount of 
homework they assigned each week are the primary measures. We supplemeru these with student 
reports of their absenteeism over the school year (from the spring questionnaire) and the amount of 
homework they had in the class over the year (an average of the student's fall and spring reports). 
Absenteeism is included here as a control variable, since it is associated with ability group placement, 
instructional quantity, and learning outcomes. 

Instructional quality. Instructional quality is measured with teacher and student reports. The 
teachers were asked to indicate the hours per week they devote to various kinds of instructional 
activities, including teacher-directed interactions with the class (lectures and discussions) versus small 
group work and the individualized activities of seatwork, one-on-one help, and computer tutorials. 

We also use two measures of instructional quality from the student questionnaires. The spring 
1989 student questionnaires asked the students to grade their math and science classes in terms of how 
much they like the subject matter, the teacher's clarity, the intellectual challenge, the textbook's 
clarity, and the overall difficulty. Only one multiple-indicator factor emerged from a principal 
components analysis of the student course evaluation questions. This factor is reasonably 
characterized as the intellectual accessibility of the course (the constituent items are listed in the 
appendbc). We also use a single-item indicator of the intellectual challenge of the course, taken from 
the same battery: "How much does this course challenge you to use your mind?* 



STATISTICAL MODELS 
Our main hypothesis argues that the effects of group placement are mediated by class resource 
and instructional variables. Testing this hypothesis presupposes that group placement indeed has an 
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effect on student achievement^ and our first task is thus to specify and estimate a model of group 
effects. As the discussion of analysis samples indicated, our group effects model differs somewhat 
for mathematics and science. This is because virtuaiiy all of the schools in the LSAY sample use 
ability grouping in eighth grade math but only about half of the schools use ability grouping in eighth 
grade science. For mathematics, the group effects hypothesis can be formally represented by the 
following regression model: 

Math Achievement, = + iSt(Background)i + iS2(High Group)i + 

iS^CLow Group)i + (1) 
where the dependeikt variable is student i's level of achievement at the beginning of ninth grade, 
Background includes multiple measures of prior achievement, grades in schooU and social 
background; and High Group and Low Group are dummy variables indicating the student's ability 
group placement. The omitted category against which the effects of the group placement dummies is 
interpreted is the middle ability group. 

Group effects on science achievement are estimtited for all students (except the small number 
of students enrolled in low tracks) whether or not enrolled ia a grouped school. These estimates are 
obtained by adding an additional term to represent the stiidenis attending schools not using ability 
grouping in eighth grade Science: 

Science Achievementi - ofi + iSi(Background)| + iS2(High Group)i + 

i33(Nongrouped)i + (2) 
The omitted reference category fcr the dummy variables is thus the average-level students attending 
schools using ability grouping in eighth grade Science, As discuss',d earlier, we excluded low-track 
students from the science analyses because of the very small sample size for this group in the LSAY 
(n==35, after other data availability restrictions were applied). 
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The theoretical and empirical research on tracking suggests that the effects of the class 
resource and instruction variables on achievement can be estimated with simple additive elaborations 
of equations (1) and (2); 

Math Achievementi - + /Si(Background)i + jSi(High Group)i + 
/?3(Low Group)i + /J4(Class Resources)} + 

/Jj(Instniction), 4- C31 (3) 

Science Achievementj = ^4 + /?i(Background)i + /Ja(High Group); + 
^3(Nongrouped)i + A(Class Resources)i + 

/S,(Instruction)i + (4) 

Our analyses also address two other important sources of error that much of the prior work 
on grouping effects has ignored. The first miglit be called "treatment selection bias," and it concerns 
the estimates of the group placement effects in equations (1) and (2). Campbell and Erienbacher 
(1970) criticized correlational analyzes of the effects of "high" versus "low" program placements on 
achievement growth for relying too heavily on single pretest controls for initial differences among 
students The problem is that measurement error on the pretest leads to inflated estimates of (positive) 
high-group and (negative) low-group effects. This claim could be assessed by including an adjustment 
for pretest unreliabElity into the model. Unfortunately, it is usually not possible to determine exacdy 
the reliability of the pretest (and other covariates). The main alternatives for making this adjustment 
are to use published test reliability estimates or to use multiple indicators of tiie unobserved "true 
score/ Since the former strategy is not well sutcwl to panel models witn repeated measures (Jencks, 
1985), we rely on the latter strategy. Thus for both the science and mathematics models, we include 
seventh and eighth grade math and science test scores plus students* self-reported grades as controls 
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for prior differences, in addition to the standard social backgi'ound variables. Although no set of 
controls can eliminate treatment selection effects widi certainty, our controls are more extensive than 
those in most previous non-experimental work on tracking. In other studies with equally extensive 
sets of controls, Gamoran and Mare (1989) and Hoffer (1992b) showed that by including multiple 
protest scores, estimates of bias due to nonrandom selection to tracks could be minimized. 

A second potential source of error in this analysis is tied to the problems of nonresponse 
among students and dieir teachers. This problem can be thought of as one of "sample selection bias." 
Analyses not presented here sho'V that nonresponse is not random with respect to the variables in the 
models, being more likely for lower-achieving and lower-SES smdenis. Precisely how the loss of 
the^e cases affects the estimated effects of the variables in the model cannot be determined a priori , 
and over the past decade analysts have developed special techniques for addressing the problem of 
potential biases (cf Berk, 1983 for a discussion). These methods essentially involve estimating two 
equations^ a "selection" model and an "outcome" modd. The selection model is specified for all 
students. It gives estimates of the effects of the independent variables on the likelihood of being 
excluded from the analysis subsample, and these estimates are then used to assign each respondent a 
predicted probability of being included in the analysis. The outcome model is estimated for the subset 
of cases that has complete data. It has the form shown in equations (1) through (4) but also includes 
the predicted probabilities from the first model. One is thus able to determine whether the likelihood 
of exclusion is related to the outfiome^ and to control for the effects of whatever correlation that 
likelihood has with the other independent variables. 

If the selection equation were a linear regression with the same independent variables as the 
outcome equation, the outcome equation would not be identified. Identification is ordinarily achieved 
through either or both of two methods: use of a nonlinear logistic or probit estimator for the selection 
equation^ and exclusion from the outcome equation of one or more predictors in the selection 
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equation. We use both mediods here, using the logistic estimator for the selection model, and adding 
indicators of school location (dummy indices of suburban and rural versus urban) to the selection 
model but not to the outcoine model. Urban location proves to be strongly associated with 
nonresponse iii the LSAY data, yet location has virtually no independent effect on achievement and 
can thus be excluded from the outcome model.** 

FJESULTS 

Grouping. Cognitive Growth, and Instruction 
The first problem we address is essentially descriptive in nature, and concerns the extent to 
which ability groups differ on the indicators of classroom functioning available in the LSAY data. To 
set die context of these comparisons, it is useful to consider first the magnitude of the student 
achievement and background differences among ability groups. The upper panel of Table I (science 
subsample) and Table 2 (mathematics subsample) shews that average achievement scores diff&r greatly 

between ability groups, but change little from seventit to ninth grade. The differences between the 

I 

high and the low groups in mathematics range from 1 A to 1 .7 pooled standard deviation units at each 
grade level, and the high-average group differences in both subjects are about half of those 
magnitudes. Most of the group differences found at the beginning of the ninth grade are thus already 



* In addition to the school location dummies, the science subsample 
selection equation included the following measures (see Appendix A for 
desciptions ) : Female, SES, Blackr Hispanic, 19 87 Science Score, 1988 Science 
Score, 1986 Cortipoeite Grades i 1987 Science Graded , and 7th Grade; Science Clas? 
Ability Group Placemeat (dummies for highr middle , and lowr where nongrouped 
students are the omitted reference category). The science equation was estimated 
for 2,310 students. The model fit statistics showed that 60% of the cases had 
predicted prob2U>ilities of exclusion which agreed (when rounded to zero or one) 
with their actual status. 

The matiheinatics subsample equation included rural and suburban location 
duxnenies. Female, SGS, Black, Hispanic/ 1987 Mat:h Score, 1988 Math Scorer 1986 
Composite Grades r 1987 Hath Grades, and 7th Grade Math Class Ability Group 
Placement (dummies for high, middle r and low, where nongrouped student e are the 
omitted reference category) . The model fit statistics were somewhat better than 
for science: 65% of the 2,212 cases were predicted correctly. 
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in place at the beginning of seventh grade» but grouping may still be associated with a growing 
inequality of what students learn. 

The second panels in Tables 1 and 2 show the distribution of student social background 
variables by ability group. We include these variables in our analyses primarily as controls for 
differences among students which are established prior to ability group placement and thus classroom 
instniction. Nonetheless, it is worth bearing in mind that the social compositions of classes are 
important organizational outcomes in their own right, irrespective of whatever achievement 
differences nnay be found after controlling for background. Consistent with findings from studies of 
school records and many other local and national surveys, these figures show that higher tracks enroll 
disproportionate numbers of students from higher SES lilies, and enroll disproportionately lower 
numbers of African-American and Hispanic youth. The measures of parental expectations for 
academic success also favor higher-trade students. As Gamoran and Mare (1989) observed in an 
earlier national data set» gender differences among tracks are small and favor girls for higher-group 
placement With respect to social class and race-ethnicity, though, it is clear that one consequence of 
ability grouping is to increase the segregation of students across classes, particularly in the 
mathematics curriculum. 

The next panels in Tables 1 and 2 give a picmre of how schools allocate instructional time, 
students, and teachers to the different ability groups. Schools assign teachers with stronger 
educational backgrounds to die higher ability groups in mathematics but not in science. The LSAY 
data show substantial differences in college majors for math teachers of different ability groups: 64 
percent of high-group math teachers were math majors compared to only 43 and 46 percent in the 
remedial and average classes, respectively. More of the average-group math teachers have college 
minors in math, but the low-group teachers are are no more likely than high -group teachers to have 
minors. Stvidents in remedial math groups thus have teachers which, on average, are well below the 
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levels of formal preparation seen among the teachers of high and middle-group students. In science, 
however, the inequalities are much smaller. Sixty-eight percent of the high group teachers and 6l 
percent of the middle group teachers had college majors in science. Virtually all the science teachers 
had at least a college minor in their subject. 

The teachers' reports on their major field are loosely corroborated by their reports of the 
numbers of undergraduate and graduate courses tiiey hsve taken in science. High-group science 
teachers have not. taken more science courses than those teaching average groups. Nor is thcr^ a 
clear pattern of inequality across ability groups in the science teachers' reports of their coursework in 
secondary science education. The ability group differences are sharper in mathematics, where the 
variability in post-secondary coursework is lower and where there are clear differences among the 
high, middle, and low groups. High-group math teachers also completed more coursework in 
mathematics education than the middle and low-group teachers, who are about equal in that respect. 

The other dimension of teacher background for which we have measures is teaching 
experience, There are no meaningful differences in average years of teaching experience among 
3bilit>- groups apparent in Table 1 or 2. We also examined two other measures of experience which 
are not shown in the tables: (!) total years of teaching experience, regardless of the field now taught, 
and (2) whether the teacher was in his or her first or second versus third or higher year of teaching 
overall. Neither of these indicators showed any differences among ability groups, 

Average- and high-ability groups differ little iu the numbers of students enrolled in classes, 
but remedial math classes are much smaller than the rest (21 students versus 26). In this case, ability 
grouping serves to allocate more resources (teacher time per pupil) to the lower group, despite the 
standardization of regular class instructional time across groups in mathematics. In science, however, 
class sizes are about equal but high group students are much more likely to have an additional 
laboratory session. 
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The other teacher resource variables refer to the more subjective realm of "teacher quality," 
particularly as evaluated by students. Students rate their mathematics teachers somewhat higher than 
their science teachers on all three of the measures used here. Both measures of teacher expectations 
(for academic and career push) show more favorable ratings in the higher ability groups. The other 
dimension we measured, 'teacher effectiveness," also indicates that higher-group students give more 
positive racings to their teachers on the items comprising this scale. Contrary to the image of teachers 
holding low expectations for low-group students^ the high levels of teacher academic push (maximum 
value of 1.0) in all groups mean low^-group students generally report high levels of encouragement 
even though their levels are comparatively lower. 

The lower panels of Tables 1 and 2 show die distributions of teacher and student responses to 
questions about the relative emphases of dieir courses, tiie time allocations in their classes, and 
instnicdonal methods, broken down by ability group. The emphasis questions show that higher track 
science teachers \ '^e greater emphasis on problem solving and the experimental method, but that the 
teachers in all three ability groups report quite high average emphases on these variables. A similar 
pattern holds in mathematics, where we see that problem solving is emphasized more in high track 
classes, at the same time as it is given high priority by remedial and average classes. The reverse 
pattern with similar qualifications holds for the math teachers* emphases on computational skills: 
lower groups place significantly greater emphasis on these skills, but the substantive range on the 
variable is not great. 

The time use variables show only small ability group differences among science classes. 
Higher ability science classes devote slightly less time to discipline problems, slightly more time to 
review, and have a half hour or so more homework per week, but the time spent on routines, new 
material, and testing are roughly equal. The time use panems show stronger track differences in 
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mathematics. Higher ability mathematics classes spend less time on discipline and review, more time 
on covering new material, and the Students have over an hour more homework to do each week. 

The next block of variables refers to instructional methods and how the classes are organized. 
Higher ability science classes use lectures and small group formats more, but are about equal in the 
use of discussions, seatwork, and computerized instruction. Teachers of higher science groups also 
return a greater proportion of the students' homework. Mathematics classes show different patterns 
than science classes on several of these variables. Higher ability math classes also use lectures more, 
but use small group and individualized study formats less. Corrected homework is returned Igss often 
in higher math classes^ pertiaps indicating a tendency to use homework more in class and less as a 
means of social control. 

The fmal variables included in Tables 1 and 2 are more difficult to categorize in terms of the 
means-end schemes apparent in the other classroom variables. These are the more subjective student 
evaluations of the accessibility of the material and the intellectual challenge of the courses. Here we 
find that higher group students in both subjects generally give their teachers and courses more 
favorable ratings. The largest track differences in both subjects appear to be on the assessments of 
the intellectual challenge of the course, but all students consider ttieir science and mathematics courses 
to be very challenging (average ratings of about 4 on a l-to-5 scale). 

In sum, the instruaional variables indicate a number of substantial differences across ability 
groups: Teachers of higher-ability classes (1) are more likely to have majors in their teaching field, 
(2) are viewed by their students as holding higher performance expectations, (3) place greater , 
emphasis on problem solving and conceptualization, (4) devote more time to covering new material 
and less on maintaining discipline, and (S) use whole-group instructional methods (lectures and 
discussions) somewhat more itequently. These descriptive tabulations show some large inequalities 
among classes at different ability levels. Neither the intended nor the unintended consequences of 
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most of these differences are clear, however, and thus one cannot niake a strong case for unfairness 
or fairness on the basis of Table 1 alone. For exampC;;, the distribution of teachers by ttieir 
educational backgrounds favors the higlier tracks, but it is not clear whether teachers with majors are 
any more effective than those without. Additionally, we do not know whether lower-track teachers 
could in faa implement a more problem-oriented curriculum in the face of student apadiy or 
recalcitrance, and, if they could, whether their students would actually actually learn wo^e. If so, 
then the case for untiaimess in the distribution of learning goals and the attendant resources is 
strengthened. A prima facie case for unfairness can be made on some ^ ariables, however: Perhaps 
the clearest examples are the teacher encouragement items. One is haid pressed to think of legitimate 
reasons why teachers should give less encouragement to lower-track students than they give to higher- 
track students. Even here» though, it is imponant to remember that the questions about 
encouragement were asked of the students, and there may be a tendency to give more negative 
appraisals of teachers among students who are academically less successful, independently of the 
teachers' behavior. 



Effects of Group Placement on Learning 
Before turning to the analysis of the effects of the instructional variables on achievement 
growth, it is useful to establish the magnitude of the group placement effects which we are trying to 
explain. The columns labeled "Model 1 ^ in Tables 3 and 4 present the results of regressions of ninth 
grade math and science achievement on the student background and ability group variables. Since the 
eighth grade achievement score is included as an independent variable in Aese models, the effects 
estimated for the social background and ability group variables are interpreted as effects on eighth-to- 
nintli grade achievement change , rather than as effects on the level of ninth grade achievement These 
results show that ability group placement does, in fact, have a strong effect on learning in both 
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subjects. In the mathematics equation, the reference group against which the effects of high- and low- 
group placement are interpreted is the middle-ability group. The estimates show that high-group 
students are 1.8 points above the middle group, while low-group students are 1.9 points below. These 
differences translate into effect size estimates of about .15 of the total sample standard deviation on 
the posttest fbr the one year interval. 

The science achievement model applies to grouped and nongrouped students and thus also 
includes three comparison groups: high track, average track, and nongrouped. The regression 
equation shown in Table 3 uses the average-group students as the omitted reference category, and the 
coefficients on the group placement dummies are interpreted as the effects of being in the high group 
or in a heterogenous instructional setting versus being in the average group in an ability-grouped 
school. The results show that high*group students gain 1 .6 more points than their nongrouped 
counterparts (ES = 1.6/9.5 = .16, where the 9,5 is the nongrouped sample standard deviation on the 
posttest). Nongrouped students, in contrast, show no significant difference from average grouped 
students over the one year period. 

Turning briefly to the other variables in the models (the full regression results are not 
presented here but are available from the authors), the controls for initial achievement clearly 
dominate the statistical results, as one would expect. The measure of sample selectivity ("probability 
of exclusion" in Tables 3 and 4) has a strong association with growth in both math and science, 
indicating that the analytic subsample is in feet different (positively selected with respect to leaiii^ng) 
from the full sample. It is worth noting that the effects of social background are uniformly trivial: 
SES, race-ethnicity, and parental expectations have no significant effects on eighth-to-ninth grade 
achievement growth in either subject. Any direct effects of social background appear to be confmed 
to science achievement: Girls learn somewhat less than boys, and SES has a weak positive effect on 
growth. 
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The main point from Model 1 for present purposes is that ability group placement has effects 
on how much students leam in their eighth grade math and science courses. While these effects 
constitute only the point of departure for the main problem we are addressing, these results are 
noteworthy in light of recent claims about the ineffectiveness of ability grouping (Slavin, 1990). The 
results for mathematics presented here support the claim that grouping has differential effects, 
benefiting high-group students while depressing the achievement of low-group students. The results 
for science also show gains of high group placement relative both to average group students and to 
students in schools not using ability grouping. 

Tracking and Classroom [nstniction 

The next step in the analysis involves the addition of variables describing the classroom 
resources and experiences of students. We divide this analysis into two segments, represented by the 
columns labeled "Model 2" and "Model 3" in Tables 3 and 4. Model 2 includes all of the variables 
from Model 1 plus measures of what we have referred to as resources: teacher educational 
background (whether the teacher majored in the field taught), years of teaching experience in the 
respective field, the numba of students in the class, and the student's reports of the teacher's 
performance expectations and encouragement. In science, we include the presence of a laboratory 
period as an additional resource. Model 3 builds on Model 2 to include the full set of class variables. 

Some of the variables described in Tables I and 2 are not included in the regression analysis. 
Exclusions were made on the basis of preliminary regressions in order to simplify the presentation of 
results. Teacher background variables excluded were the Science and Math Minor dummies, and the 
College Course Credits count measures. Neither showed any relation to achievement, and were 
dropped because they were conceptually redundant with the Science and Math Major variables. We 
also tested for nonlinear effects (diminishing returns) of Teaching Experience, but found that the 
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model fit was better with the simple linear specification. Several of the Instructional Quantity 
variables shown in Tables 1 and 2 were also dropped irom the analysis. The main factor we are 
concerned with in this set is 'time on task," and this is most clearly represented liy the Time on New 
Materials measure. Since Time on Routines, Discipline, Review, and Testing are in this light simply 
alternatives to Time on New Material, we relied on the single indicator. Similarly, we considered the 
teacher-reported Homework Assigned to be conceptually redundant with Ihe student reports of their 
Homework Time, Since the latter is a better measure of quantity, we used it instead. Exclusions 
from the set of Quality measures included Seatwork Time, Individualized Time, and Teaching 
Machine Time. The theoretical concept behind these and the Lecture, Discussion, and Small Group 
Time variables concerns the direct presence or absence of interactions with the teacher. Work in 
small groups represents an intermediate context between teacher-directed and individualized class 
time. We thus excluded the Seatwork, Individualized, and Teaching Machine Time variables since 
we regard them as the alternative implied by the retained measures. 

The results of Model 2 give little support for the hypothesis that inequalities among ability 
groups in the resource variables leads to learning outcome differences* The only statistically 
significant effect of Teacher Background in either subject is for whether the science teach^ majored 
in science. As Table 1 showed, high- and average-ability groups differ slightly on this variable, and 
it thus cannot account for the effect of being in a high group on learning. The small explanatory 
power found in the resource variables is tied to the other indicators. Science and math class size and, 
in science, the use of separate laboratory session, show no significant effects. The teacher 
expectation variables also prove to be only weakly related to student achievement, but two indicators 
have significant effects: Career Push has a positive effect on science achievement, and the students* 
rating of their teachers' effectiveness has a positive effect on math achievement. 
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The results of the Model 3 regression equations show that some instructional variables have 
effects OQ achievement growth, while others do not. A key point to note first is that the addition of 
these variables statistically accounts for most of the estimated effect of ability grouping in science, but 
relatively little of either the high- or low-group effects in math. In science, the effect of high group 
placement decreases from +1.6 to +0.4. The iosignificant difference between nongrouped students 
and average-group students seen in Model 1 changes very litde with the elaborations of Models 2 Ziia 
3. In mathematics, the coefficients for low group placement decrease from -1.9 to -1.6 (17% of the 
effect accounted for), whiie the effects of high group placement shrink from + 1.8 to + 1.2 (33% of 
the effect is explained). 

While the instructional variables do not fully account for the effects of ability group 
placement, several do have effects on the achievement outcomes in the expected directions. For 
science. Model 3 in Table 3 shows that greater emphasis on "inquiry" or experiential learning has a 
positive effect on learning. As Table 1 showed, high track classes are about a fiiU standard deviation 
higher than the average group and heterogeneous classes oti this scale, and this difference is not 
entirely reducible to a simple correlate of student background differences. For reasons beyond 
student backgrouid differences, high track teachers place greater emphasis on the various components 
of this scale: problem solving, experimental logic and design, systematic observation and reporting, 
and mathematical applications in science. The effect on learning, although statistically significant, is 
not enormous: Students in science classes which are a fiitl standard deviation above the sample 
average of the scale are predicted to realize an advantage of only O.OS of a standard deviation unit on 
the science achievement test. At the same time, that does represent about a third of the ^otal effect of 
being in a high cbitity group. It m t also be remembered that the time span of these effect.; is only a 
year (fall of eighth grade to fall of ninth grade), and one which includes a summer at that. 
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Other variables showing statistically sigaificant effects on science growth in Model 3 include 
student absenteeism, the use of small groups, and the studeDt*s rating of the accessibility of the course 
materials. From Table I, high group students have lower absenteeism and their teachers use small 
groups more often, and thus these factors also account for some of the reduction in the estimated 
high-group effect from Model 1 to Model 3. The ratings of course accessibility, in contrast, show no 
effect of track, and the contribution of this factor does not mediate the ability grouping effects. 

The strongest effects on mathematics achievement are associated with emphasis on problem* 
solving, the rate of coverage, and the accessibility of the material. Of these, high-ability classes have 
a much greater emphasis on problem solving than average and low group classes, which tend to be 
about the same. The positive effect of problem solving on achievement thus accounts for some of the 
high-ability group effect, but none of the low-ability group deficit. In contrast, high- and average- 
ability groups are about the same in terms of coverage, but are higher than the remedial group. The 
positive effect of more coverage thus accounts for some of the remedial deficit, but none of tiie high 
group's advantage relative to the average group. The teacher's emphasis on ''facts and principles," 
and the teaclier's greater use of lecture formats, also contribute positively to achievement, whereas 
reported emphasis on computation tends to depress achievement. Of these, the greatest contribution 
to explaining the ability group differences is tied up with the emphasis on computation. In this case, 
Table 2 showed that the three ability groups are ordered in a clear hierarchy of decreasing emphasis 
with increasing ability. This factor thus accounts for both part of the remedial deficit and part of the 
high group advantage. 

Tests for Nonlinear and Multiplicative Effects 
While the models we have estimated are consistent with the conceptual model of Figure 1, 
one might question whether the effects of class resources and instruction fit this assumed linear, 
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additive formuiatica. One alternative hypothesis is that the effects of resources and instruction on 
learning will depend on students* levels of engagement in their coursework: The greater die student's 
engagement, the greater die benefit frc a addidonal resources and more and/or better instruction. As 
Newmann (1992) has discussed, engagement is difficult to measure with survey questions. 
Nonetheless, student engagement is likely to be associated with ability group placement, with the 
result that the effects of additional resources and greater instructional quantity and quality should be 
higher in higher ability groups. 

A second hypothesis is that our measures of class resources and instruction have different 
meanings in the different ability groups. For example, an emphasis on "problem-solving" could refer 
to low-level formulaic objectives in lower-ability groups, but more complex cognitive processes in 
higher-ability groups. If this is the case, then we would again expect to fmd different effeciS in the 
different ability groups. 

A third hypothesis is that the effects of resources and instruction may show laonlinear effec'u^ 
due to the common phenomenon of diminishing returns to productive inputs. In contrast to diose 
typically modeled in the production Amotion which economists apply to the firm, diminished returns 
in ihe present case are more likely tied to students' bounded capacities or willingness to assimilate 
new matei'ial. In contrast to the other hypotheses, this suggests the effects of resource and instruction 
variables could be lower m higher-ability groups, where the levels of those variables are more likely 
to approach their productive limits. 

These hypotheses suggest a simple alternative formulation to equations (3) and (4), one where 
the effects of resources and instruction are allowed to vary across ability groups: 

Math Achievement^ = + i3i(Background)i + /SzCHigb Group)i -h 
^(Low Group )i -h |?4(Class Resources)i + iSjCInstruction)^ + 
^^(High Group x Class Resources)} + 
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l57(Low Group x Instruction)i + €5, 



(5) 



Science Achievement^ = + Pj(Backgroun4)i + i32(High GroupX + 



ft(Nongrouped)i + i3^(Class Resources^ + /JjOnstruction^ + 



/?4(High Group x Class Resources); 4- 



^StOLow Group X InstnicUonX + 



(6) 



We first re-estimated Models 2 and 3 with the full set of ability group-by-class variable 
interaction terms included. Most of these proved statistically insignificant, but a few showed effects 
v/hich introduce some qualifications to the linear, additive formulation. To simplify the results, we 
re-estimated the equations excluding all interaction terms with significance levels of p > .10. For 
present purposes, we provide only a verbal summary of this supplementary analyses; the fiill 
regression results are available from the authors. 

The results show some consistency between science and math, and some disjuncoires. The 
estimates for Science Model 2 show that Teaching Experience has a positive effect for high -group 
students, but is zero or slightly negative for the others . In Math Model 2, three variables had 
significant interactions. First, only high-group students whose teachers had math majors learned more 
than students whose teachers had non-math majors. Second, larger class size has a significant 
negative effect for high-group math students, but no effect for anyone else. Thirds teacher career 
push has a negative effect for low-group students, but no effect for anyone else. 

The results for Mode! 3 show three significant interactions in both subjects. In science. 
Emphasis on Inquiry has a large positive effect for both high- and middle-group students (b- 3, IS vs. 
1.02 in Table 3), but no net effect for non-grouped students (b= -3. 82; net effect- 3.15 - 3.82= .67 
ns). Homework Time has a positive effect for high-group students, but no effect for anyone else. 
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Finally^ Lecture Time has positive results for high- and low-group students, but no net effect for the 
nongrouped students. 

Model 3 for math shows interactions for problem-solving^ absenteeism^ and homework. 
Additional emphasis on problem-solving is effective for low- and middle- but not high-ability group 
classes. Absenteeism hurts only high-group students, and additional homework helps only high-group 
students. 

The coefficients in Tables 3 and 4 for the variables which did not have significant ability 
group interactions show no important changes with the addition of the interaction terms. Our 
summary assessment of the interaction results is that there is no clear pattern in the interactions, and 
the hypotheses we outlined have little support. The results introduce a few imponant qualifications, 
but raise further questions that we could only speculate on at this point: Why do nongrouped students 
receive no benefits from greater emphasis on inquiiy and more use of lectures in science? Why do 
high math group students show no benefits fram greater emphasis on problem solving, while low- and 
average-group students are helped? 

In any case, there are some difficult interpretive issues even for the the effects which were 
'X)nsistent with our hypotheses. One key problem is the ambiguity around whether these interactions 
represent ability-by-treatment effects versus effects of differences among groups in what a 
nominally-identical treatment actually involves. The former effect would niean, for example, that if a 
hilxed group of students did the same homework assignment, the slower students would get nothing 
out of it while the faster students learned something. The latter effect could ti)ean^ for example, that 
higher-group students benefit from more homework because their teachers assign them challenging 
problems; lower-group students, in contrast, learn nothing from their homework because the 
homework assigned is little more than busywork. 
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If the ability-by-treatxnent hypothesis is correct, then we might find significant 
ability-by-treaanent interactions within ability groups. To test this possibility we added pretest (eighth 
grade science or math score)-by-instruction multiplicative terms to the models which already included 
the group-by-instruction interactions. The regression results showed that aone of these new 
multiplicative terms had significant effects, and in no case did they explain away the significant 
group-by-insiniction interactions. Thus we conclude that the later interactions are niost likely 
reflections of different implementations of practices, ratlier than masking differences in students' 
capacities to benefit from those practices. 



SUMMARY AND CONCLUSIONS 
A large body of research has documented the importance of instructioimi grouping within 
schools for student academic success, but has stopped short of giving a systematic account of the 
differences among groups responsible for the different outcomes. The analysis pre{>ented here 
addressed this problem with recently collected survey data. These data are noteworthy in several 
respects: a national sample of middle schools and students, objective measures of ability group 
structure and student placements, reasonably reliable Outcome measures, and relatively detailed data 
from teachers about their course objectives and routines. These data were then analyzed in terms of a 
straightforward ordinary least squares regression framework, which included an adjustment for sample 
selection bias* 

The results provide limited support for the theoretical lines identified in previous research. At 
a descriptive level, Tables 1 and 2 showed several ways in which higher-track classes differ from 
lower-track classes, generally in directions which appear to favor stitdents in the high group. 
Inclusion of these variables in the regression equations explained about 75% of the effect of being in a 
high-ability science class on eighth-grade achievement growth. Most of the statistical explanation is 



tied to the greater encouragement, emphasis on ii^quiry, and use of team projects in high-ability 
science dashes. 

The explanatory power of the class resource and instruction variables in mathematics was 
much Jess, accounting for 33% of the high-ability group effect, and only 18% of the low-group effect 
on growth. Most of the explained part of the high-group effect is tied to the greater emphasis on 
problem-solving and tb^ lower emphasis on computation. Most of the expJained portion of the low- 
group (relative to the average group) effect is tied to the slower pace of coverage in low-group 
classes. 

Why does the model not account for all of the effects of ability grouping on achievement? At 
least three reasons seem possible. First, our controls for pre-existing conditions may not have 
addressed some aspects of selection into the different tracks that arc associate with achievement. In 
that case, apparent track effects that remain could reflect ditferences among students instead of 
variation in what occurs in die different tracks. We think this is the least likely explanation, first 
because our controls are extensive, including four measures of prior achievement, two measures of 
prior grades, and the standard battery of social background controls. Furthermore, the estimated 
reliabilities of our academic and social background control variables are similar to indicators used in 
other studies. 

Assuming the remaining track effects are real, a second reason our model does not e^^p'iain 
them AiUy may be that our measures do not capture all aspects of iostruction that differ across tracks. 
For example, the LSAY did not ask teachers or students which specific topics they covered. 
Moreover, aspects of instruction which are addressed in the data are measured imperfectly. We rely 
on teacher and smdent reports rather than on classroom observations » and we allow questions about 
instructional objectives to proxy for a range of important activities that may occur in class. More 



36 



39 




extensive information about instruction might allow us to explain a greater portion of the effects of 
tracking on achievement. 

Third, thete are undoubtedly other mechanisms at work, even if differential Instruction is the 
most important as we have argued here. Assignment to different tracks may affect the way students 
view themselves and their schoolwork, thus influenciog the amount of effort they put in, the way they 
behave in class, and the extent of their achievement (see Gamoran and Berends, 1987). In our model, 
the only indicator of effort was time spent on homework, and we included no measures of academic 
self-concepts or attitudes toward schoolwork. We indirectly tested for this specification error by 
assessing whether the effects of instructional variables were greater ia the higher ability groups, but 
found only limited and inconsistent support for that prediaion. 

Aside from their value in accounting for track effects, the effects of instructional conditions 
are interestii\g in their own rights. We observed several similarities in the patten>s of effects in the 
two subjects. In math, significant instructional effects on achievement were those of emphasizing 
problem-solving, textbook coverage, lecture time, and studeots' perceptions of the class as accessible 
to them. Taken at face value, these results suggest that a more conventional approach in which the 
teacher presents material to students is more effective than interactive or "student-centered" methods. 
Much mort; work is needed, particularly work that includes classroom observation, to illuminate these 
speculations. 

In science, teachers who devoted more time to small-group work produced higher 
achievement, whereas teachers who lectured more did not consistently have greater success. 
Textbook coverage also had no effects on achievement. Students fiirther benefitted when they found 
class materials accessible and when their teachers emphasized inquiry methods, although an emphasis 
on science in everyday life did not appear productive. Both the use of small groups and greater 
accessibility go against die grain of some traditional models of science pedagogy. These results 
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indicate that students team more when they work together on tasks as opposed to the traditional 
emphasis on individual work; the greater success associated with higher accessibility stands agamsc 
the increasingly-criticized view diat it is the solely the student's responsibility to make sense of the 
traditional canon. Again, these findings need to be supplemented by observational studies of 
classroom activities. 

Consistent with Oakes' (1990) report, we find evidence of considerable variation within 
schools in resources for and implementation of secondary math and science instruction. Some of this 
variation appears consequential fbr student achievement. Our models suggest that if instruction in 
average and (in math) low tracks were brought to the level of that in high tracks, less inequality 
within schools would emerge. Our supplementary analyses of interaction ei!ects place some 
qualifications on these inferences, fbr we did find that the benefits of a few variables do not hold in 
all ability groups. Generally we can say, however, that instruction which is effective on average-for 
example, inquiry methods in science, and more textbook coverage in math— is equally important fbr 
students in the different ahilitj' groups and at varying levels of prior performance within groups. If 
further work with observational data or more incisive survey questions confirms this conclusion, we 
could use high'track classes, which are currently more effective in both subjects, as models fbr 
redesigning instruction for all students. Lacking such information, we cannot yet be certain that this 
is the best course to follow. 

Similarly, our models for eighth grade science achievement also hold some interesting 
suggestions about whether heterogeneous instruction would be more effective than current tracking 
systems. Tbe science data showed diat students in heterogeneous classes achieved at levels similar 
(actually lower, but not significantly) to those in average classes and less than those in high-track 
classes, controlling for prior performance levels. The descriptive data showed that these classes tend, 
as one might expect, to show levels of class resource and instructional variables which are 
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intermediate between high and average classes within the ability grouped schools. Our supplemental 
analyses of interaction effects found that, however, that inquiry methods and lecture formats seem to 
work less weU in the nongroiqped contexts. This suggests that teachers of heterogeneous classes face 
special problems which require some different approaches. Hoffer's (1992b) comparison of ability- 
grouped and nongrouped mathematics classes in the LSAY data showed a similar pattern to science ir. 
terms of achievement outcomes, but LSAY does not have teacher data on a large enough number of 
heterogeneous classes to carry out the analyses pursued in this study. Hence, we need more 
information about the limits and possibilities of heterogeneous classes before we can say that mixed- 
ability teaching is less effective for some students than grouping by ability in middle school math and 
science. 
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Appendix A. Description of Variables Used in the Analysis 



Variable Name 



Description (LSAY codebook variable names in parentheses) 



1989 Math Score IRT score for composite test consisting of NAEP items (EMTHIRT). 
1989 Science Score IRT score for composite test consisting of NAEP items ^SCIIRT). 

Student Social Backg roimd 
Female Student gender ( I = female, 0 = male) (AA2^) r 

Black, Hispanic Smdent race (O-I) and ethnicity (O-I) 

SES Family composite socioeconomic status; constructed from parental education 

(maximum of MOTHED and FATHED), occupation (maximum of Duncan 
SEI variables BH281S, BH287S, DH267AS, and DH267BS), and household 
possession index (sum of BA15A, BA15B, BAI5C, BA15F, and BA15I: daily 
newspaper, a specific place to do homework, a typewriter, a room of your 
own, and a weekly news magazine). The three components were separately 
standardized, and the standard scores averaged to form an equally-weightad 
composite for each smdent (SES3). 
Parent Push Parent push for success in science (math) classes: Student report from year I. Sum of 
smdent's agreements with AA19F /AA19E ("My parents have always encouraged me 
to work hard on science /math"), AA190 /AA19N ("My parents expect me to do well 
in science /math"), and AAI9R /AA19Q ("My parents think science /math is 
important") (PSCPHI, PMHPHl). The alpha reliabilities are .71 fer the math scale 
and .73 for the science scale. 



Student Academic Background 



1987 Math Score 

1988 Math Score 

1987 Science Score 

1988 Science Score 

1986 Grades 

1987 Science Grades 
1987 Math Grades 



IRT composite score (missing values imputed) (AMTHIRT). 
IRT composite score (missing values imputed) (CMTHIRT). 
IRT composite score (missing values imputed) (ASCIIRT). 
IRT composite score (missing values imputed) (CSCIIRT). 
Smdent-reported composite CPA (missing values imputed) (AA33). 
Smdent-reported science grades (missing values imputed) (CA27C). 
Smdent-reported math grades (missing values imputed) (CA27B). 



Teacher Background 



Science (math) major 
Science (math) minor 

Science (math) credits 

Sci. (math) educ. credits 

Teaching Experience 



Teacher majored in field (0-1) (BE87BI, BE87B2, BE88B1, BE88B2). 
Teacher minored in field (0-1) (BE87C1, BE87C::2, BE88C1, 
BE88C2). 

Total # of graduate & undergrad. courses in field (TOTSCI, 
TOTMTH). 

Total # of grad. & undergrad. secondary education courses in field 
(TOTSSE, TOTSME). 

Years teaching expedience in field (science or mathematics) (BF2, 
BG2). 



Cla5;s Characteristics 
Class Size Total # of students in class (DI3C, DJ3C). 

Weekly Class Time Minutes of class per week (DI4, DJ4). 
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Spence Lab Additional lab period for science course (0-1) (DISA). 

Science Lab Time Minutes of additional science lab per week: (DI56). 



Teacher Expectations & Encouragement: Student Repons 
Academic Push Scierice (math) teacher pushes student to work hard: student reports. 

CoQStincted by summing individual student responses to the following spring 

1989 true-false-don't know (coded 1, 0, -1, respectively) questions about "My 

science (math) teacher this semester..." (alpha =.73): 

0 ... expects me to do my best all the time (DA2B, DA3B). 

0 ... encourages me to do extra work when I don't understand 

something (DA2C, DASH. 
0 ... expects me to work hard on science (math) (DA2D, DA3D). 
'O ... expects me to complete my homework every night (DA2E, DA3E) 
0 thinks it is very important that I do well in science (math) (DA2P, 

DA3P). 



Career Push Teacher pushes student toward scientific or mathematical career: student 

reports. Constructed by summing individual student responses to the 
following spring 1989 true-false-doa't know (coded 1, 0, -1^ respectively) 
questions about "My science (math) teacher this semester..." (alpha ^ 
0 ... has talked to me about the kind of job I might want to do (DA2G, 
DA3G). 

0 ... expects me to go to college pA2H, DA3H). 

0 ... has encouraged me to take all the science [math] I can get in 

school (DA2M, DA3M). 
0 ... has encouraged me to think about a career in math or science 

(DA2N, DA3N). 

Effectiveness Teacher is conscientious & effective: student reports. Construrted by 

averaging student responses to four tnie-false-not sure itenns (a = .71 for 
science, .73 for math): 'My science (math) teacher this semester..." 
0 ... really enjoys teaching science (math]. 
0 ... is a very good teacher. 

0 ... gives me extra help when I don't understand something. 
0 ... really seems to like me 



Instructional Objectives: Teacher Reports 

Emphasis on Inquiry Science course emphasizes problem solving & inquiry; average of 

science teachers' responses to the following four-point Likert scale 
items asking about the relative emphasis given to different objeaives 
in the particular class: Teaching of experimental logic and design 
(DIIOC), Developing problem solving/inquiry skills (DIlOE), 
Developing systematic observation skills (DIIOH), and Teaching 
applications of mathematics in science (DIIOI). The alpha coefficient 
for the scale estimated at the student level is .75. 

Science in Everyday Life Science course emphasizes the importance of science for everyday life, 

especially environmental issues. Constructed by averaging teachers' 
responses to items asking about die relative emphasis given to: 
Increase awareness of the importance of science in daily life (Dl lOG), 



Emphasis on Facts 



and Learning about applications of science to environmenti\l issues 
(DUOL). 

Science (math) course emphasis: Teaching science (math) facts and 
principles (DUOB, DJIOB). 
Emph. on Froblem-Solving Math course emphasis: Teaching math problem solving skills 

(PJIOD). 

Emphasis on Computation Math course emphasis: Developing computational skills (DJlOLv). 



Text Coverage 
Days Absent 
Time on Routines 
Time on Discipline 
Time on Mew Material 
Time on Review 
Time on Testing 
Homework Assigned 
Homework: Time 



Instructional Quantity: Teacher & Student Reports 



7o of text covered this year: Teacher report (DH, DJ7). 
Days absent this school year: Student report (DA 17). 
% class time: daily routines (Dill A, DJllA). 
% class time: getting students to behave (DIUB, DJUB). 
% class time: presenting new material (OIllC, DJllC). 
% class time: review or student practice of skills (DIl ID, DJl ID). 
% class time: testing and evaluation (DU IE, DJl IE). 
Hrs of homework/week assigned: Teacher report (DI22, DJ22). 
Hrs of homework/week: Student report (CASCU. DASCU; CAMTHJ. 
DAMTHJ). 



Instructional Quality; Teacher Reports 



Lecture Time 
Discussion Time 
Small Group Time 

Seatwork Time 

Individualized Time 
Teaching Machine Time 

Homework Returned 



Class hours/week: Lecturing to the class (DI9A, DJ9A). 

Class hours/week: Leading discussions (DI9B, DJ9B). 

Class hours/week: Student woric in small groups or laboratory (DI9C, 

DJ9C). 

Class hours/week: Having students do seatwork on homework, 
worlcbook, or text assignments (DI9D, DJ9D). 
Class hours/week: Providing individualized instruction (DI9E, DJ9E). 
Class hours/week: Having student use teaching machines or computer- 
assisted instruction (DI9F, DJ9F). 

% homework assignments correaed and returned to students (DI24, 
DJ24). 



Instructional Duality: Student Reports 



Class Accessibility 



Class materials & ideas are accessible: student reports. Constructed 
by averaging student responses to five questions about the student's 
current science and math courses (all items are coded to range from 
1 = "F" to 5 = "A/): 

o How much do you like the subjea matter of this course? A 

means you really like the subject; F means you hate it. 
o How clear is the teacher in explaining the material? A means 

very clear; F means not clear at all. 
o How useful do you think this course will be to you in your 

career? A means that it will be very useful; F means that it 

will be of no use. 

0 How clear is the textbook for this course? A means very clear; 
F means hard to understand. 
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0 How difficult or easy is this course for you? A means that it is 
very easy; F means that it is very difficult. 

Class Challenge Class is intellectually challenging: student report G3ASCIE, 

DAMTHE). Responses range from "'A\ it challenges you a lot/ to 
"'F^ it never challenges you/ coded to range from 1 = "?" to 
5 = "A/ 
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Appendix B. Effects of Ability Group Placements on Classroom Vaiiables. 

Tables I and 2 showed several differences between ability groups in the levels of the 
instructional variables, but did not answer the question of whether ability group placement has an 
effect on these variables. To answer that question, it is necessary to determine whether students with 
comparable academic backgrounds, but who are in different ability groups, differ in their classroom 
experiences. As we have noted, schools and teachers are likely to adapt instruction to the preparation 
and orientation of their students. It may be the case> then, that while group placement is associated 
with instructional differences (as shown in Tables I and 2), group placement has no independent 
effea on students' instructional experiences, once the effects of individual student background 
differences are taken into account. 

We address this issue by regressing the instructional variables on students* ability group 
placements, prior achievement and grades, and social background. The coefficients on the abiUty 
group variables in these regressions thus give the estimated effects of group placements on the 
students' instruaional experiences, controlling for academic and social background. The problem v/e 
address in this analysis should not be confiised with the related, but still distinct, organizational issue 
of why the instruction variables diCEer between classes . To answer that issue properly, one would 
estimate class-level equations, wherein variability in instruction is some function of ability group level 
and the distribution of student background characteristics in the class. Estimating such an equation 
would indicate whether class characteristics (instruction) are affected by the ability group of the ^ass, 
net of the distribution of the background characteristics of the students enrolled in the class. While 
that would be a useful analysis, limitations of the LSAY sample do not permit it, for there are not 
enough sampled students in niost classes to obtain reliable estimates of the background distributions in 
each class. Another way of thinking about the difference between our analysis and a class-level 
analysis is that we are trying to answer questions about the allocation of opportunities to students, 
while the later addresses an issue of the production of opportunities. 

The results of the regressions of the class variables on the group placement dummies and 
background controls are shown in Tables B*l and B-2. 
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Appendix B-1. Effects of Eighth-Grade Students' Science Ability Group Placement on Science Class Resource 
and Instnictional Variables: Metric Coefficients 





Nongrouped 


High Group 






minus 


minus 


Adjusted 


Dependent Variable 


Average 


Averaee 




Resources 
















Science M^or 


0.04 


-0.09* 


.05 


dcxenco iviuior 


_n CO 






Science Credits 


-2.48*** 


-2.66** 


.05 


Science Educ. Credits 


-3.15*** 


-3.15*** 


.19 


Teaching Hxperience 


4.91*** 


-1.46* 


.12 


Class Characteristics 














nil 


Class Time 


0.23 


7.82** 


.13 


Science Lab 


0.09** 


0.20*** 


.10 


Lab Time 


5.91** 


8.41** 


.05 


Teacher Expectations 
















Teacher Career Push 


0.03 


0.30*** 


.07 


Teacher Effectiveness 


-0.08** 


0.06 


.04 


Instruction 








Obiectives 








Emphasize Inquiry 


0.10** 


0.53*** 


.15 


Emphasis on Facts 


0.25*** 


0.23*** 


.05 


Emphasize Sci in Life 


-0.02 


0.46*** 


.12 


Quantity 








Text Coverage 


'3.55*** 


2.92* 


.07 


Days Absent 


-0.05 


-0.28** 


.11 


Time on Routine 


-0.33 


-1.29** 


.07 


Time on Discipline 


-0.72 


-2.42*** 


.03 


Time on New Mateiiais 


-3.91*** 


-2.98** 


.05 


Time on Review 


3.05*** 


4.43*** 


.06 


Time for Testing 


1.72*** 


1.72** 


.09 


Homework Assigned 


-0.46*** 


0.72*** 


.13 


Homework Hme 


0.02 


0.39** 


.02 


Quality 








Lecture Time 


-0.12** 


0.12 


.06 


Discussion Time 


-0.55*** 


0.54*** 


.26 


Small Group Time 


0.27*** 


0.23** 


.06 


Seatwork Time 


0.46*** 


-0.04 


.15 


Individualized Time 


0.29*** 


0.34*** 


.13 


Teaching Machine Time 


0.08*** 


-0.04* 


.04 


Homework Returned 


-0.77 


0.47 


.09 


Class Accessibility 


0.04 


-0,04 


.10 


Class Challenge 


0.06 


0.27** 


.03 



*p<.10 **p<.05 ***p < .001 

Note: Effects are estimated from student-level regressions of the science class resource and instruction 

variables on students' ability group placements (dummy variables for nongiouped and high group) and 
controls for seveuth- and eighth-grade science and math achievement, sixdi-grade composite grades, 
seventh-grade grades in science, gender, race-ethnicity, parental encouragement and expectations, and 
the predicted probability of being excluded from the analysis sample, 
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Appeadix B-2. Effects of Eigbth-Giade Students* Math Ability Group Placemeiits on Math Class Resource and 
Insttuctional Variables: Metric Coefficients 





Low Group 


High Qioup 






minus 


minus 


Adjusted 


Dependent Variable 


Average 


Average 




Resources 








Teacher Backfiround 








Math Major 


0,02 


0.23*** 


,05 


Math Minor 


-0,13** 


-0.13 


,05 


Math Credits 




2.26 


.07 


Math Educ. Credits 


0.41 


2.80*** 


.06 


Teaching Experience 


0.79 


-0.63 


.13 


Class Characteristics 








Class Size 


-4.31*** 


0.90* 


.13 


Class Time 


-8.09** 


0.34 


.06 


Teacher Expectations 








Teacher Academic Fush 


^.05* 


0.04 


.05 


Teacher Career Push 


-0.02 


0.19*** 


.06 


Teacher Effectiveness 


^.05 


0.12** 


.04 


Instnictton 








Objectives 








Emphasis on problem Solving 


^,00 


0.31*** 


.12 


Emphasis on Facts 


^.10** 


0.02 


.07 


Bmphflsis on Computation 


0.23*** 


^.30*** 


.15 


Ouantitv 








Text Coverage 


-3.02** 


1.29 


.14 


Days Absent 


0.01 


-0.17* 


.12 


Time on Routine 


^.43 


-1.13** 


.07 


Tir:ie on Discipline 


1.79** 


-4.52*** 


.12 


Time on Mew Materials 


-1.65 


7.37*** 


.13 


Time on Review 


1.65 


-0.54 


.04 


Time for Testing 


-i.n* 


-1.01* 


.02 


Homework Assigned 


-0.05 


1.19*** 


.14 


Homework Time 


0.09 


1.06*** 


.12 


Oualitv 




9 




Lecture Time 


-0.10* 


0.05 


.03 


Discussion Time 


^.25*** 


-0.03 


.05 


Small Group Time 


0.03 


-0.06 


.01 


Seatwoiic Time 


-0.10* 


-O.OO 


.09 


Individualized Time 


0.13** 


-0.16** 


.06 


Teaching Machine Time 


0.16** 


-0,03 


.12 


Homework Returned 


15.5*** 


2.5 


.04 


Class Accessibility 


0.18** 


-0.09 


.07 


Class Challenge 


0.05 


0.44*** 


.04 



* p < .10 *♦ p < .05 *♦* p < .001 

Mote: Effects are estimated firom studeat-Ievel regressions of the math class resource and instruction variables on 

seventh* and eighth-grade science and math achievement, sixth-grade composite grades, seveolb-grade grades in 
matht gender, race-ethnicity, parental eacouragemeat and expectations, and the predicted probability of being 
excluded from the analysis sample. 
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Class ^ Class 

Resources Functioning 




NOTE: Direct paths from Student Background, Ability Group placenaeat, and Class Resources 

to Achievement are estimated in the data analysis, but are not shown here for clarity 
of presentation. 



FIGURE K Schematic Model of Within-School Influences on Achievement. 
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Table 3. Estimated Effects of Group Placdment and Class Variables oq Eighth Grade Learning: Science 

(Standard errors in pareatheses) 



GROUP PLACEMENT 

NoQgiouped (vs. svezage) 
High (vs. average) 


Mnde] i 

•0.52 (0.49) 
1.56 (0.77)** 


MfldfiL2 

-0.57 (0.51) 
1.25 (0.78) 


Model 3 

-0.77 (0.57) 
0 39 (0 87^ 


RESOURCES 

Xeacher Background 
Science Major 
Teachinff Exnerience 




1.26 (0.47)** 
0 01 f0.03^ 


0.63 (0.55) 


Class Characteristics 
Class Size 
Science Lab 




•O.03 (0.04) 
0.75 (0.51) 


-0.04 (0.04) 
0.39 (0.56) 


Teacher Expectations 
Acadenuc Push 
Career Push 
Teacher Effectiveness 




-0.84 (0.60) 
0.79 (0.44)* 
0.47 (0.44) 


-0.90(0.61) 
0.81(0.44)'* 
0.06 (0.50) 


mSTRUCTION 

Objectives 

Emphasis on Inquiry 
Science in Everyday Life 
Emphasis on Pacts 






1.02(0.57)* 
-0.40 (0.42) 
0.58 (0.42) 


Quantity 

Text Coverage 

Days Absent 

Time on New Material 

Homework Time 






0.006(0.02) 
-0,38(0.18)** 
0,01 (0.02) 
■0.04 (0.17) 


OualitY 
Lecture Time 
Small-Group Time 
Discussion Time 
Class Accessibility 
Class CbflJlenge 






0.39 (0.41) 
0.99 (0.45)** 
0.30 (0.39) 
0.71(0.31)** 
-0.14 (0.19) 


Probability o f Exclusion 


•18.51 (4.47)** ^ 


-19.6 (4.58)** 


-18.54 (4.94)** 


ADJUSTED 


0.55 


0.55 


0.56 



*p < ,10, **p < .05. 

Note: All three equations include coQtrols for seventh- and eighth-grade science and math achievement, sixth- 
grade composite grades, seventh-grade grades in science, gender, race-cthnicity, and parental 
encouragement and expectations. « 
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Tabl« 4. Estimated Effects of Group Placement and Class Variables on Eighth Grade Learning: 

Matheaiatics (Standard Errors in Pareaiheses) 



Model I Model 2 Model 3 

GROUP PLACEMENT 

Low (vs. Average) -1.88 (0.57)** -2.07 (0.59)** -L55 (0.60)** 

High (vs. Average) 1.84 (0.57)** 1.75 (0.58)** 1.21 (0.61) 

RESOURCES 

Teachfif Background 

Math Major 0.33 (0.41) 0. 12 (0.43) 

Teachiiig Experience 0.04 (0.03) 0.04 (0.03) 

Class Characteristics 

Class Size -0.05 (0.04) -0.05 (0.04) 

Teacfaer Expectations 

Academic Push 0.16 (0.71) 0.01 (0.70) 

Career Push -0.26 (0.39) -0.26 (0.39) 

Teacher Effectiveness 1.05 (0.48)** 0.90 (0.50)* 

INSTRUCTION 
Objectives 

Emphasis on Problem-Solviiig 0.71 (0.35)** 

Emphasis on Facts & Principles 0.75 (0.39)* 

Emphasis on Computation -0.55 (0.31)* 

Quantity 

Text Coverage 0.07 (0.02)** 

Days Absent -0.24 (0. 18) 

Time on New Material 0.01 (0.01) 

Honaework Time ' 0.01 (0.12) 

Duality 

Lecture Time 0. 66 (0.35)* 

Small-Group Time 0.30 (0.38) 

Discussion Time 0.62 (0.41) 

Class Accessibility 0.58 (0.29)** 

Qass ChaUenge 0.03 (0.19) 

Probability of Exclusion -5.95 (1.75)** ^1.87 (1 .86)** -3.26 (1.91)* 

ADJUSTED ' 0.62 0.62 0.63 

*p < .10 **p < .05. 

Note: All three equations include controls for seventh- and eighth-grade science and math achievement, sixth- 
grade con^site grades r seventh-grade grades in science, gender, race-ethnicity, and parental 
encouragement and expectations. 



S3 



BEST COPY AVAIUBLE 



